CDS

Accession Number TCMCG040C40115
gbkey CDS
Protein Id RDX74742.1
Location complement(join(16049..16279,16367..16510,16598..16685,17537..17642,18406..18560,18876..18937,21521..21821,23286..23353,23574..23677,23748..23924,28959..29096,30050..30233,30322..>30663))
Gene SPPA
Organism Mucuna pruriens
locus_tag CR513_45472

Protein

Length 699aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA414658, BioSample:SAMN07821433
db_source QJKJ01010076.1
Definition Serine protease SPPA, chloroplastic, partial [Mucuna pruriens]
Locus_tag CR513_45472

EGGNOG-MAPPER Annotation

COG_category OU
Description protease
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko01000        [VIEW IN KEGG]
ko01002        [VIEW IN KEGG]
KEGG_ko ko:K04773        [VIEW IN KEGG]
EC -
KEGG_Pathway -
GOs -

Sequence

CDS:  
ATGTCACGCACTCGCATTGGCGTTCACCGCTTCCGCTACAGCTACACGGCATTATCGTCAATCCTCTCTCCACCACCATCATCCTCCTATTCAATCACTCGCTTTCAGTTCCAATGTCCCCAACAATATCTCACGCGCGCACCCCTCCCCTTCCACTACCACCCTCCTCGTTATTATTCATCCACCTTAGTCGGTGGCGATGAGCATTATCCCACTGGAGACTTCGATTTCAAGCCCGTCACAGGCTGGAAAAATTTCATTCTCAAGCTCAAGATGCTAACAGCCTTCCCCTGGCAGCGTCTCCGATACGGCACCGTCTTGACAATCAAGTTGCGCGGCCAGATTCCGGATCAGCTGAAGAGTAGGTTCTCCTCGGGATTATCTCTGCCTCAAATCTGTGATAATTTCTTGAAGGCAGCCTACGATCCTCGAATTTCCGGCATCTATCTCCATATTGATATTTTAAATTGCGGCTGGGCCAAGGTCGAAGAAATTCGAAGGCATATCTTGAATTTCAGGAAATCAGGAAAATTTATTGTGGCTTATGTCCCTTCATGTCGAGAAAAAGAATATTATATTGCGTGCGCATGTGAAGAGATTTATGCTCCTCCAAGTGCTTATTTTTCTTTGTTTGGATTGACTGTTCAAGCCCCATTCCTCAGAGGTGTTTTAGAGAATCTTGGAATTGAACCACAGGTGGAAAGGATTGGGAAATACAAAAGTGTAGGTGATCAACTAACCCGTAGAACCATGTCTGAAGATCATCATGAGATGCTGAATGCTTTGCTTGATAATATCTATGCAAATTGGCTGGATAAAGTCTCTTCTGCTAGAGGTGCAAGAAAAAAAAGAGAAGATATTGAGAATTTCATAAATGAAGGTGTCTATCAAGTAGAGAAGCTTAAAGAAGAGGGCTTCATATCAGACATACTCTATGATGATGAGGTTATCGCTAGGTTGAAGGAGAGACTTCAAGTTAAAACAAATAAAGATCCGCCTATGGTTGATTACAGAAAATACTCTAGAGTTAGGAAATCAACTCTTGGACTATCAGGTGGTAAAGAATTAATAGCCATTATTCGAGCTTCGGGGAGTATTCGTCGTGTCGAGGGTCCATTAAGTTCCCGTAGCTCAGGTATCATTGGAGAGAAGTTCATTGAGAAGATACGCAATGTAAGAGGTACACTCCAGCATATAAAGTTAACTTCTTTGGAACGGCACGTAATTAGTGAAATGTTAAAGTTGTCTAGATTGGAGTTACATTATTCACTATTGGTAGCAACAATTTGCAACAACCTTATCTTGCTGAGTGAGAAATATAAGGCTGCTATTATCCGAATTGACAGTCTAGGAGGTGATGCCCTTGCTTCTGATTTGATGTGGAGAGAAATCAGGCTTCTGGCTGCCGCAAAACCAGTCATTGCTTCAATGTCTGATGTGGCAGCAAGTGGAGGGTACTACATGGCAATGGGAGCAGGAGTTATTGTTGCAGAGAGTCTTACCTTAACTGGTTCAATTGGAGTGGTCACAGGAAAACTTAACCTTGGGAAGCTTTATGAGAAGATTGGCTTCAACAAAGAAATTATATCAAGGGGTAGATATGCTGAGCTCCGGACAGCTGAACAGCGTTCTTTTAGACCAGATGAAGCAGAGCTATTTTCCAAGTATGCGCAGCATGCTTATAAACAATTTCGAGATAAGGCAGCTTTTTCCAGATCTATGACTGTAGAAAAGATGGAAGAGGTTGCACAGGGAAGGGTTTGGATTGGTAAGGACGCAGCTTCTCATGGTTTGGTTGATGCTATTGGCGGCCTTTCTCGAGCTGTTGCCATAGCAAAATTGAAGGCCAATATACCTCAAGACAGACAGGTTACCGTTGTGGAGCTCTCGAGACCCAGCCCTTCACTACCCGAGATTTTTAGTGGTCTAGGTAATTCTCTCGTTGGAGTAGACAGAACCTTAAAGGAGTTACTTCAAGACTTGACATTTTCCCATGGAGTCCAAGCACGAATGGACGGAATCGTGTTTGAGAAACTGGAAGGATATCCATACGCCAATCCCATTTTTGCATTGATGAAAGATTATCTTAGTTCTCTGTAG
Protein:  
MSRTRIGVHRFRYSYTALSSILSPPPSSSYSITRFQFQCPQQYLTRAPLPFHYHPPRYYSSTLVGGDEHYPTGDFDFKPVTGWKNFILKLKMLTAFPWQRLRYGTVLTIKLRGQIPDQLKSRFSSGLSLPQICDNFLKAAYDPRISGIYLHIDILNCGWAKVEEIRRHILNFRKSGKFIVAYVPSCREKEYYIACACEEIYAPPSAYFSLFGLTVQAPFLRGVLENLGIEPQVERIGKYKSVGDQLTRRTMSEDHHEMLNALLDNIYANWLDKVSSARGARKKREDIENFINEGVYQVEKLKEEGFISDILYDDEVIARLKERLQVKTNKDPPMVDYRKYSRVRKSTLGLSGGKELIAIIRASGSIRRVEGPLSSRSSGIIGEKFIEKIRNVRGTLQHIKLTSLERHVISEMLKLSRLELHYSLLVATICNNLILLSEKYKAAIIRIDSLGGDALASDLMWREIRLLAAAKPVIASMSDVAASGGYYMAMGAGVIVAESLTLTGSIGVVTGKLNLGKLYEKIGFNKEIISRGRYAELRTAEQRSFRPDEAELFSKYAQHAYKQFRDKAAFSRSMTVEKMEEVAQGRVWIGKDAASHGLVDAIGGLSRAVAIAKLKANIPQDRQVTVVELSRPSPSLPEIFSGLGNSLVGVDRTLKELLQDLTFSHGVQARMDGIVFEKLEGYPYANPIFALMKDYLSSL